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Abstract 

We consider an alternative derivation of the GSO Projection in the free fermionic 
construction of the weakly coupled heterotic string in terms of root systems, as well as 
the interpretation of the GSO Projection in this picture. We then present an algorithm 
to systematically and efficiently generate input sets (i.e., basic vectors) in order to study 
Landscape statistics with minimal computational cost. For example, the improvement 
at order 6 is ~ 10~ 13 over a traditional brute force approach, and improvement increases 
with order. We then consider an example of statistics on a relatively simple class of 
models. 
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1 Free Fermionic Models and the String Landscape 



During the decade following the first string revolution, the focus of string phenomenology 
was two-fold: development of methods for constructing consistent string models with com- 
pactified dimensions and searching within the domain of each construction method for those 
models with at least quasi-realistic phenomenology [Tj . Many three generation string models 
were found within a few years. The dominant view back then was that discovery of the 
true string vacuum (model) was within reach-that the string "E needle" would eventually 
be found within the stringstack of but a few trillion vacua. Eventually a handful of quite 
realistic MSSM or Near-MMSM three generation models were indeed found [3J, especially 
following the first Minimal Standard Heterotic String Model [I] . 

However, following the second string revolution the rise of M-theory has taught string 
phenomenologists the likely impossibility of finding a "true" string vacuum somewhere on the 
string/M-landscape composed of at least 10 500 vacua. In addition to vast number of vacua, all 
vacua now appear to be on equal footing. Thus, the phenomenological goal has shifted from 
studying individual string models to better understanding statistically the characteristics of 
the string/M-models on the landscape, or at least within specific domains [5]. 

The (often overlapping) domains on the landscape frequently correspond to model con- 
struction methods. One construction method that has been widely explored in terms of 
individual models and for which large scale statistical studies are underway is free fermions 
[HI U\. The free fermionic heterotic string has provided many quasi-realistic (Near-)MSSM- 
like models EJ [TU], (semi-) GUT models [TJJ [T2], and GUT models [T3]. In the context 
of the second string revolution, we should now proceed to determine overall pheonenolog- 
ical patterns within these and the many more, as yet undiscovered, free fermionic models. 
Therefore, let us review the free fermionic approach as a means of developing a systematic 
method of generating models in vast numbers from which the statistics of phenomenological 
properties may be developed. 



2 The Standard GSO Projection 

The first object required to specify a model in the free fermionic heterotic string [61 [7] is 
a set 

A = {c? e Q 64 n (-1, l] 64 |z G {1, L G N}} (2.1) 

where components a), j = 1, . . . , 20 are boundary conditions for real worldsheet free fermion 
degrees of freedom of the left-moving supersymmetric string, and a* for j = 21, . . . , 64 are 
boundary conditions for real worldsheet free fermion degrees of freedom on the right moving 
bosonic string. In the C basis (complex fermions), each component of a 1 is double counted, 
and cP is a 32 (10 + 22) component vector (which can be generalized to include left-right 
paired real fermions). 

The order Ni of a given a % is defined a^] 

Ni = mm{m G N | ma) = mod 2 Vj}, (2.2) 

*We include "0" in the set of natural numbers N as in set theory, in contrast to its general exclusion in 
number theory. 
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with 



% = LCM(iV i ,iV i ). (2.3) 
Thus, each component of an a of order N is of the form 

(3), e ^ n [-1, l) (2.4) 

Modular Invariance demands 

Ni ot ■ 6t = mod 8 (2.5) 

Nij ct ■ a j = mod 4 (2.6) 

in the C basis j6j|7]ji] One additional requirement is that each model contain a 1 = I, the 64 
real-component vector with every element equal to one. 

In a given free fermionic model, the different sectors of a model are formed by all linear 
combinations of the c?'s with coefficients m k e N where each coefficient m k < iVj. Each 
linear combination, or sector, is denoted 

L 

V k = Y,™ k a l (2.7) 

i=i 

We can (and will) think of each set of coefficients m k as an L-dimensional vector in N L , 
whose i th component is constrained by the order of a 1 . 

For a given sector V k , a worldsheet fermion fj transforms as 

fj -> -<'^J) (2.8) 

around non-contractible loops on the worldsheet. Thus, for R fermions, V k must be either 
or 1, whereas for C fermions, V k must be rational. 

For each sector, we can form the U(l) charges for the Cartan generators of the unbroken 
gauge groups (which are in one to one correspondence with the U(l) currents f* fj for each 
complex fermion fj); 

Q^ k = l -V k + F k (2.9) 

where F k is a fermion number operator which counts each mode of fj once and of /* minus 
once. Or, in other words, F k E { — 1, 0, 1} V 1 

The second object required to specify a model in the free fermionic heterotic string is an 
L x L matrix kij. Modular Invariance imposes the following constraints on fc^-; 



kij + kji = ■ a j mod 2 (2.10) 



^Further, any set of three basis vectors a" 1 , a 3 , and a k , including cases where i, j, andor k may be identical. 
However, this requirement is automatically satisfied by gauge sector basis vectors of the form discussed herein, 
so this contraint will not be discussed more in this paper. 
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hi + hi — ^cf ■ a 1 — Si mod 2 



(2.11) 



and 



Njkij = mod 2 



(2.12) 



where Sj is the 4 dimensional spacetime component of a 1 . Furthermore, we demand hj G 

(-Ml- 

In this paper, we will concentrate on the (massless) gauge sectors, so we will assume Sj=0 
and a\ = for i = 0, . . . , 20 in everything that follows. Furthermore, we can express the 
masses of physical states as functions of the charges; 



/ 2 1 i A \2 1 

a^icft = jWMdi) - 2 



(2.13) 
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a bright = ^(Qy^ght) ~ 1 ( 2 - 14 ) 
We have already demanded that mi e ft = above. So, to make m ri ght = 0, we demand 



(Q 



V fc ,left^ 



Now, the GSO projection constraint is; 

L 



® % ■ Qyk = ^2 m nhn + Si mod 2 



(2.15) 



(2.16) 



n=l 



with Si = in our models. 



3 Model Building and the Weyl Conditions 

The formulation described in section [2] will produce a set of states which form an abstract 
root system of simply laced type. We therefore begin by considering the connections between 
the approach outlined in section ijand the Weyl Constraints: A set $ of vectors Q l with norm 
squared 2 form a root system of simply laced type iff they satisfy the following constraints 

0; 



Q\ Q j e $ 
Q\ Q j e $ 



Q< ■ Q ] e Z 



(3.1) 
(3.2) 
(3.3) 



From equations (2.7) and (2.9), it is clear that, given 

Q' 



(3.4) 



k=l 
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we can define Q n by replacing F* with — F % and m\ with — m l k , so that 



1 L 1 L L 

q 1 + q" 1 = ^ X m ^ fc + pl + o X^ - m ^" fc - = X^ 3 * = 



(3.5) 



fc=i 



fe=i 



fe=i 



So Q n = —Q % . It is clear that Q n will sati sfy M odular Invariance and the GSO Projection 
as long as Q l does, and therefore equation (3.1) is satisfied. 

Satisfying equation (3.3) is a bit trickier. We can make it more transparent by expanding 
out only one of the roots, 



Q XX** + ■ <y e z 



i l 



mla k .Qi = a -F l - Q J 



(3.6) 



k=l 



for some o£Z. From equations (2.9) and (2.4) we can write the general form of F l ■ Q 1 as 

L 



F % ■ Q 3 



IE 



k=l 



mia 



r ■ F i ■ r X '"l- • • 



j bik 



k=l 



(3.7) 



where bn~,c G Z. So, (3.6) now gives 

2 



^ L L 



m 



(3. 



k=l k=l 

where d is now some integer which we can effectually ignore, leaving 

L L 



Q 3 = J2< 



ik 



k=l 



k=l 



(3.9) 



Without loss of generality, we can take the m' k } s on the left hand side of (3.9) to have a sing^ 
non-zero unit element. So, defining k^ = ^r~, we have that in order to satisfy equation ( 3.3b 



a k ■ Q 3 = X mlhk 



k=l 



where kjt. satisfies 



N k k ik = mod 2, 



(3.10) 



(3.11) 



which is equivalent to what was stated in section [2] In other words, we have derived the 
general form of the GSO Projection merely by imposing the constraint that our formalism 
(cf. equation (2.9)) result in an abstract root system of simply laced type. 
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4 Towards Comprehensive Landscape Statistics 



A difficulty in collecting comprehensive statistics in the free fermionic approach (in ad- 
dition to the sheer size of this domain of the landscape) is in finding an efficient means of 
systematically generating sets of a's obeying equations ( |2.5 ) and (2.6). To date, searches 



within the landscape of free fermionic heterotic models have been, although wide-ranging, 
have used random sampling |14| . One difficulty with regard to random sampling within the 
free fermionic portion of the landsacape, is the issue of floating correlations, as discussed in 
|15j . This problem was shown in [15] to be endemic to random statistical landscape searches 
and reflects the fact that not all physically distinct string models are equally likely to be 
sampled in any random search through the landscape or within a specific subspace. This 
can result in statistical correlations of phenomenological properties of models that "float" as 
a function of sample size. 

While several possible methods were proposed in [15] to overcome this problem, an al- 
ternative is to devise a search method that is not random. The remainder of this paper is 
an exposition of a method of doing exactly this. As the simplest example, we will consider 
herein the set of Layer 1, Order 2 calculations. Since the primary purpose of this paper is to 
outline our general method, a vastly more detailed systematic search, based on the approach 
introduced herein, will be the subject of a paper to soon follow [16J. 

Not counting the I vector, if we have a Layer L set {<?}, i = 1,...,L, with Orders 
Ni, N 2 , . . . , Nl, a brute force approach to finding all Modular Invariant sets requires calcu- 
lating dot-products for ( Yli=i N) candidate vectors, demanding a total of 

22 



i=l 



L\ (\{Ni) (4.1) 



calculations, keeping track of valid sets along the way. This obviously becomes intractable 
even for very low Orders and Layers. Layer one and Order two (which we will denote L102) 
will demand « 4 x 10 6 , while L104 demands ^2x 10 13 . This can be simplified by putting 
the elements in order in a 1 . Then, under each block of similar elements in a 2 , separately 
put each element of a 2 in order. This can be repeated for each successive a. While this 
is extremely helpful for small L, the need to order elements separately for each block of 
elements above quickly reduces the usefulness of this for higher Layer. 

We can greatly simplify this problem by breaking the set of cfs into a tensor product 
of points in a lower dimensional space. We elaborate through examples. For an a of, say, 
order 2, the possible elements are 1 and 0. We therefore specify a given order 2 a by a single 
number n\, which specifies the number of l's. The l's are assumed to be moved to the left, 
so that the 22 — n 2 0's are to the right. For example, n\ = 4 would specify the vector 

a = (1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0) (4.2) 

The dot-product is easy to calculate; a 2 = n\. 

For order 3, the possible elements are ±2/3,0. So, we specify an order 3 a with two 
numbers; n\ corresponding to 2/3, and n 2 corresponding to —2/3. In this case, a 2 = 
§(ni + n 2 ). 
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This generalizes in the obvious way to higher order. Higher Layer, however, is a bit more 
subtle. Consider L2023 (Layer 2, where the first a is order 2 and the second is order 3), the 
possible columns when the a's are placed one above the other are 



1 \ ni / 1 \ n2 f]\ n:i / \ ™ 4 / x v ' 



2/3) ' V— 2/3 y '\0j 'V 2 /3/ ' V~ 2 / 3 
So, the dot products in this case will be 



(4.3) 



(a 1 ) 2 = n 1 +n 2 + n 3 

(a 2 ) 2 = ^(ni + ra 2 + n4 + n 5 ) 
2 

a 1 -a 2 = -(n 1 -n 2 ) (4.4) 

For a Layer L set with orders Ni, there will be Y[i=i(Ni) — 1 different rij's. Thus, since 
we know that 

nf =1 W)-i 

and, defining 

L 

A = HiNi) - 1, (4.6) 

i=l 

we find that the total number of calculation which must be performed in this approach is 

^ II( 22 + ( 4 - 7 ) 

' 3=1 

The improvement of this approach is therefore 

nt l( 22 +J ) 



A!(A + 1) 22 ' 

The improvement depends on the total Order, not on the Layer. To see this more clearly, 
we provide a graph of equation (4.8) through A = 500. 
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The improvement is greater for higher order. This is good news because there are no 



ordering conventions which help simplify in the original approach (equation 4.1) for large 
Layer (« L > 4), which is where larger orders typically come from. Recall that at LlOA, 
equation (4.1) demanded ~ 2 x 10 13 calculations. Equation (4.7), however, demands only 
2, 300. Furthermore, because we can now express any and all dot products as simple lin- 
ear combinations of the rij's with rational coefficients, each calculations is straightforward 
addition, rather than a dot-product of a 22 component vector. 

We now show how to further reduce the complexity of the problem by simplifying the 
imposition of Modular Invariance. To begin, in writing out the content of each rii (as in 



equation (4.3)), we will always put the elements in order of decreasing magnitude, starting 



with the positive elements and then negative. For example, L106 would be written as 



n-.i 



2\ ™4 

3 



(4.9) 



We then write a 2 = n\ + |n 2 + \n 3 + + \n 5 . So, Modular Invariance (2.5) demands 



,4141 

Of r = 6 ( ni + -n 2 + -n 3 + -n 4 + -n 5 



- mod 8 
9rii + An 2 + n 3 + 4n 4 + n 5 = mod 12. 



(4.10) 



This is a simple linear equation which can be imposed easily at each step. However, we can 
impose this a priori as follows. For an order iV a (we assume L = 1 for now) with the proper 



ordering (as in (4.9)), the elements of a will come from equation (2.4). If we demand that 
riA satisfy 



n A E - tfrij + 2NZ, 



(4.11) 



where aj is e Z and specifies the element of a corresponding to rij according to (2.4). In 



other words, ctj is an integer such that ^ £ [—1,1) is the element of a corresponding to rij. 
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For example, in (4.9), 



ai = 3, a 2 = 2, a 3 = 1, etc. 



(4.12) 



Our ordering convention guarantees that this will always be an integer. 

For L ^ 1, we can impose this constraint on each a separately by demanding that the 
rii corresponding to the last non-zero element of each a satisfy this constraint, with the 
sum going from 1 to the previous rii. Furthermore, our ordering of the n^s ensures that 
an rii corresponding to a final non-zero element will be unique. Demanding this for each m 



corresponding to the last non-zero element in each a will ensure that (2.5) is always satisfied 
for each a in the set. 

We can do something similar to impose (2.6). For the dot-product between a % and a\ 
let rik correspond to the last element which contributes to a 1 ■ oP . For example, in (4.3) 
this would be n 2 . Once again, our ordering convention for the m's ensures that each will be 
unique. If we impose 

k ~ 1 N-N- 
n k G — a ji>j n 'j H — ^r^-Z, (4-13) 



where a,- is the integer defining an element of a 1 and bj is the integer defining an element of 



a? , then (2.6) will be satisfied. 



To see this, consider L30223 as an example. We will have 





rii 






1 1 




2/3/ 




V-2/3 






f * 




1 1 




2/3/ 




V-2/3 




We then impose (4.11) on n n , n 9 , and n 6 , giving 

n u E 6Z — (m + n 2 + n 4 + n 5 + n 7 + n 8 + mo) 
n 9 E 4Z — (m + n 2 + n 3 + n 7 + n 8 ) 
n 6 E 4Z - (m + n 2 + n 3 + n 4 + n 5 ). 

So, we have (for example) 

= 2(m + n 2 + n 3 + n 4 + n 5 + n 6 ) 

= 2(m + n 2 + n 3 + n 4 + n 5 + 4Z - (m + n 2 + n 3 + n 4 + n 5 )) 
e 8Z = mod 8. 



(4.14) 



1\2 



2(« X ) 



(4.15) 



Similarly, 



3(a 



2\2 



— 3 • - (m + ^2 + n 4 + n 5 + n 7 + n 8 + mo + ™n) 

y 

4 

= 3 • - (m + n2 + ri4 + ns + n 7 + n 8 + mo + 6Z 
9 

— (m + n 2 + n 4 + n 5 + n 7 + n 8 + mo)) 
G 8Z = mod 8. 



(4.16) 
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Next, we impose (4.13) on rig, n 5 , and n 3 . 



n 4 6Z - (n\ — n 2 ). 



(4.17) 



Thus, 



6a 1 • 



a ' 



01 -ni 



2 2 

3 n2 + 3" 4 



= 4(ni — n 2 + (Z — (ni - n 2 )) 
G 4Z = mod 4. 



(4.18) 



The constraints on n 8 and n 5 are similar. 

Imposing Modular Invariance in this manner allows us to effectively generate acceptable 
input sets directly, sparing us the computational difficulty of checking huge numbers of 
unacceptable sets to find the ones we want. So, by designing an algorithm which expresses 
the sets of cf s in terms of rij's, and imposing (4.11 ) and (4.13 ), we can systematically generate 



comprehensive input sets at arbitrary Order and Layer, with a great deal of the work being 
done up front (imposing (4.11) and (4.13)). It remains to discuss how these sets are to be 



analyzed to generate statistics (including consideration of models with initially matching 
gauge groups, that may or may not lead to possibilities for different models once matter 
sectors are added). This we leave for discussion in our upcoming paper. 



Before moving on, we make one final observation. By combining (4.7), (4.11), and (4.13) 



we can write down the approximate number of of sets which satisfy Modular Invariance for 
a given L, {Ni}, i = l,...,L, and A = l\f=i( N i) ~ 



L i-l 

i=i j= 



r(23 + A) 



2 L A\(A + l) L r(23) 



(4.19) 



This expression clearly increases monotonically for increasing Layer and Order, implying an 
infinite number of valid sets. However, it is likely that past a certain level of complexity the 
models will become redundant, providing an effective upper limit. 

As an extremely elementary example of how this works, we consider the simplest possible 
example, L102. There is only one rij, and (4.11) simplifies to ri\ G 4Z, and equation (4.19) 
gives 5.75, which is a reasonable estimation of the obvious 5 solutions, ri\ G {4, 8, 12, 16, 20} 
(we exclude ni = for obvious reasons). Each of these models can be easily analyzed, giving 
Table 1. 



Table 1: The complete set of gauge group models of LI 02 class. 



m 


Gauge Group 


4 


SO(U) 


8 


SO (28) x E 8 


12 


SO(20) x SO(24) 


16 


50(12) x 50(32) 


20 


50(40) x SU(2) x SU(2) 
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Using this approach, statistics at very high Layer and Order can be produced fairly 
easily. The results of such an analysis will be discussed in much more detail in an upcoming 
paper [16]. We also wish to emphasize that the approach we have introduced here can be 
generalized to include matter sectors as well, as will be demonstrated in [T7] . 



5 Solving for Alpha 



As a final comment, we can rewrite (2.16) as 



a' 



(5.1; 



n=l 



for some set a 



Then, using (2.10) and (2.11), we can write the Cartan Matrix of a given model as 

L 

C jk = Q yj ■ Qy k = (fcf + 3)mX + k n m l m n + <l>jh + {m j ) T ■ a k + {m k ) T ■ a j , (5.2) 

n=2 



where the vector k R is a vector with components equal to the diagonal elements of kij, a k 
is the vector with i th element a^, and <pjk = ■ F k . Using this, the left hand side can be 
defined for a specific gauge group, and then the right hand side can be solved as a system of 
coupled quadratics over positive integers. The interesting thing is that the right hand side 
has no a dependence. A solution to this equation for a given k R , 4>jki and a k will provide a 
set of constraints for dps which give the gauge group defined by the desired Cartan Matrix. 
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